The Last-Step Minimax Algorithm

نویسندگان

Eiji Takimoto

Manfred K. Warmuth

چکیده

We consider on-line density estimation with a parameterized density from an exponential family. In each trial t the learner predicts a parameter t. Then it receives an instance xt chosen by the adversary and incurs loss ln p(xtj t) which is the negative log-likelihood of xt w.r.t. the predicted density of the learner. The performance of the learner is measured by the regret de ned as the total loss of the learner minus the total loss of the best parameter chosen o -line. We develop an algorithm called the Last-step Minimax Algorithm that predicts with the minimax optimal parameter assuming that the current trial is the last one. For one-dimensional exponential families, we give an explicit form of the prediction of the Last-step Minimax Algorithm and show that its regret is O(lnT ), where T is the number of trials. In particular, for Bernoulli density estimation the Last-step Minimax Algorithm is slightly better than the standard Krichevsky-Tro mov probability estimator.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Strategy to Enforce the Discrete Minimax Principle on Finite Element Meshes

Low quality meshes in threedimensional finite element diffusion simulations often violate the discrete Minimax principle. An indication of this shortcoming is the occurrence of negative concentrations. We present an a posteriori refinement algorithm to enforce the discrete Minimax principle by locally refining the mesh based on the last time-step. Two examples of diffusion simulations where neg...

متن کامل

Convergence Results of a Local Minimax Method for Finding Multiple Critical Points

In [14], a new local minimax method that characterizes a saddle point as a solution to a local minimax problem is established. Based on the local characterization, a numerical minimax algorithm is designed for finding multiple saddle points. Numerical computations of many examples in semilinear elliptic PDE have been successfully carried out to solve for multiple solutions. One of the important...

متن کامل

Truncated Linear Minimax Estimator of a Power of the Scale Parameter in a Lower- Bounded Parameter Space

 Minimax estimation problems with restricted parameter space reached increasing interest within the last two decades Some authors derived minimax and admissible estimators of bounded parameters under squared error loss and scale invariant squared error loss In some truncated estimation problems the most natural estimator to be considered is the truncated version of a classic...

متن کامل

Random Permutation Online Isotonic Regression

FORWARD ALGORITHMS Two observations: • PAVA is efficient and generalizes to partial orders • Follow The Leader algorithms are common in practice Forward Algorithm: To predict at xt, imagine y′ t ∈ [0, 1], compute f∗ on {(x1, y1) . . . (xt−1, yt−1)} ∪ {(xt, y′ t)}, and play ŷt = f(xt). FORWARD ALGORITHM EXAMPLES • IR-Int: Compute f∗ on past data. Predict with average of f∗ at nearest xi. • Inter...

متن کامل

Achievability of Asymptotic Minimax Regret in Online and Batch Prediction

The normalized maximum likelihood model achieves the minimax coding (log-loss) regret for data of fixed sample size n. However, it is a batch strategy, i.e., it requires that n be known in advance. Furthermore, it is computationally infeasible for most statistical models, and several computationally feasible alternative strategies have been devised. We characterize the achievability of asymptot...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

The Last-Step Minimax Algorithm

نویسندگان

چکیده

منابع مشابه

A Strategy to Enforce the Discrete Minimax Principle on Finite Element Meshes

Convergence Results of a Local Minimax Method for Finding Multiple Critical Points

Truncated Linear Minimax Estimator of a Power of the Scale Parameter in a Lower- Bounded Parameter Space

Random Permutation Online Isotonic Regression

Achievability of Asymptotic Minimax Regret in Online and Batch Prediction

عنوان ژورنال:

اشتراک گذاری